Predicting Publication Inclusion for Diagnostic Accuracy Test Reviews Using Random Forests and Topic Modelling

نویسندگان

  • Allard J. van Altena
  • Sílvia Delgado Olabarriaga
چکیده

Finding all relevant publications to perform a systematic review can be a time consuming task, especially in the field of diagnostic test accuracy. Therefore, the CLEF eHealth lab ‘technologically assisted reviews in empirical medicine’ was established to create a basis of comparison between various methods. In this paper we describe a method submitted to the lab. This method consists of a topic model used to extract features and a random forest to classify the relevant papers. Classifier performance shows and average decrease of 33.3% in workload (i.e., documents to read) when aiming for a 95% recall and 24.9% for 100% recall. However, there is a large variety in workload reduction (79.3% to 0.9%) between the diagnostic test accuracy reviews.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A methodological review of how heterogeneity has been examined in systematic reviews of diagnostic test accuracy.

OBJECTIVES To review how heterogeneity has been examined in systematic reviews of diagnostic test accuracy studies. DATA SOURCES Centre for Reviews and Dissemination's Database of Abstracts of Reviews of Effects (DARE). REVIEW METHODS Systematic reviews that evaluated a diagnostic or screening test by including studies that compared a test with a reference test were identified from DARE. Re...

متن کامل

Assessing variability in results in systematic reviews of diagnostic studies

BACKGROUND To describe approaches used in systematic reviews of diagnostic test accuracy studies for assessing variability in estimates of accuracy between studies and to provide guidance in this area. METHODS Meta-analyses of diagnostic test accuracy studies published between May and September 2012 were systematically identified. Information on how the variability in results was investigated...

متن کامل

A mixed effect model for bivariate meta-analysis of diagnostic test accuracy studies using a copula representation of the random effects distribution.

Diagnostic test accuracy studies typically report the number of true positives, false positives, true negatives and false negatives. There usually exists a negative association between the number of true positives and true negatives, because studies that adopt less stringent criterion for declaring a test positive invoke higher sensitivities and lower specificities. A generalized linear mixed m...

متن کامل

Fault Locating in High Voltage Transmission Lines Based on Harmonic Components of One-end Voltage Using Random Forests

In this paper, an approach is proposed for accurate locating of single phase faults in transmission lines using voltage signals measured at one-end. In this method, harmonic components of the voltage signals are extracted through Discrete Fourier Transform (DFT) and are normalized by a transformation. The proposed fault locator, which is designed based on Random Forests (RF) algorithm, is train...

متن کامل

An overview of meta-analyses of diagnostic tests in infectious diseases.

This review summarizes meta-analyses evaluating the accuracy of diagnostic tests for infectious diseases. Systematic searches identified 55 meta-analyses that satisfied inclusion criteria of reporting diagnostic accuracy of an index test compared with a reference test. All reviews were assessed for methods and reporting. The overall assessment underlined problems in several key steps: reporting...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017